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Abstract 

We introduce a problem that is a common generalization of the uncapacitated facility location (UFL) 
and minimum latency (ML) problems, where facilities not only need to be opened to serve clients, but also 
need to be sequentially activated before they can provide service. This abstracts a setting where inventory 
demanded by customers needs to be stocked or replenished at facilities from a depot or warehouse. 
Formally, we are given a set T of n facilities with facility-opening costs {/,}, a set D of m clients, and 
connection costs {c^ } specifying the cost of assigning a client j to a facility i, a root node r denoting the 
depot, and a time metric d on T U {r}. Our goal is to open a subset F of facilities, find a path P starting 
at r and spanning F to activate the open facilities, and connecting each client j to a facility <p(j) e F, 
so as to minimize J^ieF /» + ^2jev( c <f>(j),j + ^j), wnere tj is tne t ime taken to reach <f>(j) along path 
P. We call this the minimum latency uncapacitated facility location (MLUFL) problem. 

Our main result is an 0(logn maxjlogn, log m}) -approximation for MLUFL. Via a reduction to the 
group Steiner tree (GST) problem, we show this result is tight in the sense that any improvement in the 
approximation guarantee for MLUFL, implies an improvement in the (currently known) approximation 
factor for GST. We obtain significantly improved constant approximation guarantees for two natural 
special cases of the problem: (a) related MLUFL, where the connection costs form a metric that is a 
scalar multiple of the time metric; (b) metric uniform MLUFL, where we have metric connection costs 
and the time-metric is uniform. Our LP-based methods are fairly versatile and are easily adapted with 
minor changes to yield approximation guarantees for MLUFL (and ML) in various more general settings, 
such as (i) the setting where the latency-cost of a client is a function (of bounded growth) of the delay 
faced by the facility to which it is connected; and (ii) the -route version, where we can dispatch k 
vehicles in parallel to activate the open facilities. 

Our LP-based understanding of MLUFL also offers some LP-based insights into ML. We obtain two 
natural LP-relaxations for ML with constant integrality gap, which we believe shed new light upon the 
problem and offer a promising direction for obtaining improvements for ML. 

1 Introduction 



Facility location and vehicle routing problems are two broad classes of combinatorial optimization problems 
that have been widely studied in the Operations Research community (see, e.g., 112511321 ). and have a wide 
range of applications. Both problems can be described in terms of an underlying set of clients that need to be 
serviced. In facility location problems, there is a candidate set of facilities that provide service, and the goal 
is to open some facilities and connect each client to an open facility so as to minimize some combination 
of the facility-opening and client-connection costs. Vehicle routing problems consider the setting where a 
vehicle (delivery-man or repairman) provides service, and the goal is to plan a route that visits (and hence 
services) the clients as quickly as possible. Two common objectives considered are: (i) minimize the total 
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length of the vehicle's route, giving rise to the traveling salesman problem (TSP), and (ii) (adopting a client- 
oriented approach) minimize the sum of the client delays, giving rise to minimum latency (ML) problems. 

These two classes of problems have mostly been considered separately. However, various logistics 
problems involve both facility-location and vehicle-routing components. For example, consider the follow- 
ing oft-cited prototypical example of a facility location problem: a company wants to determine where to 
open its retail outlets so as to serve its customers effectively. Now, inventory at the outlets needs to be 
replenished or ordered (e.g., from a depot); naturally, a customer cannot be served by an outlet unless the 
outlet has the inventory demanded by it, and delays incurred in procuring inventory might adversely impact 
customers. Hence, it makes sense for the company to also keep in mind the latencies faced by the customers 
while making its decisions about where to open outlets, how to connect customers to open outlets, and in 
what order to replenish the open outlets, thereby adding a vehicle-routing component to the problem. 

We propose a mathematical model that is a common generalization of the uncapacitated facility location 
(UFL) and minimum latency (ML) problems, and abstracts a setting (such as above) where facilities need to 
be "activated' before they can provide service. Formally, as in UFL, we have a set T of n facilities, and a 
set V of m clients. Opening facility i incurs & facility-opening cost /j, and assigning a client j to a facility 
i incurs connection cost cij. Taking a lead from minimum latency problems, we model activation delays as 
follows. We have a root (depot) node r, and a time metric d on T U {r}. A feasible solution specifies a 
subset F C Jof facilities to open, a path P starting at r and spanning F along which the open facilities are 
activated, and assigns each client j to an open facility <f>{j) € F. The cost of such a solution is 



where tj = dp(r,(p(j)) is the time taken to reach facility <fi(j) along path P. We refer to tj as client j's 
latency cost. The goal is to find a solution with minimum total cost. We call this the minimum-latency 
uncapacitated facility location (MLUFL) problem. 

Apart from being a natural problem of interest, we find MLUFL appealing since it generalizes, or is 
closely-related to, various diverse problems of interest (in addition to UFL and ML); our work yields new 
insights on some of these problems, most notably ML (see "Our results"). One such problem, which captures 
much of the combinatorial core of MLUFL is what we call the minimum group latency (MGL) problem. Here, 
we are given an undirected graph with metric edge weights {d e }, subsets {Gj} of vertices called groups, 
and a root r; the goal is to find a path starting at r that minimizes the sum of the cover times of the groups, 
where the cover time of Gj is the first time at which some % E Gj is visited on the path. Observe that MGL 
can be cast as MLUFL with zero facility costs (where T = node-set \ {r}), where for each group Gj, we 
create a client j with Cy = if i € Gj and oo otherwise. Note that we may assume that the groups are 
disjoint (by creating multiple co-located copies of a node), in which case these QjS form a metric. MGL 
itself captures various other problems. Clearly, when each Gj is a singleton, we obtain the minimum latency 
problem. Also, given a set-cover instance, if we consider a graph whose nodes are (r and) the sets, create a 
group Gj for each element j consisting of the sets containing it, and consider the uniform metric, then this 
MGL problem is simply the min-sum set cover (MSSC) problem |[T6l . 

Our results and techniques. Our main result is an O (log n maxjlog m, log n}) -approximation algorithm 
for MLUFL (Section |27TT >. which for the special case of MGL, implies an 0(log 2 n) approximation. Com- 
plementing this result, we prove (Theorem 12.91 ) that a ^-approximation algorithm (even) for MGL yields 
an 0{p\og m)-approximation algorithm for the group Steiner tree (GST) problem ifTTl on n nodes and 
m groups. Thus, any improvement in our approximation ratio for MLUFL would yield a corresponding 
improvement of GST, whose approximation ratio has remained at 0(log 2 n log to) for a decade iMTl . More- 
over, combined with the result of ll22l on the inapproximability of GST, this shows that MGL, and hence 
MLUFL with metric connection costs cannot be approximated to better than a £l(log m)-factor unless NP C 
ZTIME{n^ l °^). 
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Given the above hardness result, we investigate certain well-motivated special cases of MLUFL and 
obtain significantly improved performance guarantees. In Section 12.21 we consider the case where the 
connection costs form a metric, which is a scalar multiple of the d-metric (i.e., d uv = c uv /M, where 
M > 1; the problem is trivial if M < 1). For example, in a supply-chain logistics problem, this models a 
natural setting where the connection of clients to facilities, and the activation of facilities both proceed along 
the same transportation network. We obtain a constant-factor approximation algorithm for this problem. 

In Section 12.31 we consider the uniform MLUFL problem, which is the special case where the time- 
metric is uniform. Uniform MLUFL already generalizes MSSC (and also UFL). For uniform MLUFL with 
metric connection costs (i.e., metric uniform MLUFL), we devise a 10.78-approximation algorithm. (Without 
metricity, the problem becomes set-cover hard, and we obtain a simple matching 0(log m)-approximation.) 
The chief novelty here lies in the technique used to obtain this result. We give a simple generic reduction 
(Theorem 12. 121 ) that shows how to reduce the metric uniform MLUFL problem with facility costs to one 
without facility costs, in conjunction with an algorithm for UFL. This reduction is surprisingly robust and 
versatile and has other applications. For example, the same reduction yields an 0(l)-approximation for 
metric uniform k-median (i.e., metric uniform MLUFL where at most k facilities may be opened), and the 
same ideas lead to improved guarantees for the fc-median versions of connected facility location OTTl . and 
facility location with service installation costs ll28l . 

We obtain our approximation bounds by rounding the optimal solution to a suitable linear-programming 
(LP) relaxation of the problem. This is interesting since we are not aware of any previous LP-based methods 
to attack ML (as a whole). In Section[3l we leverage this to obtain some interesting insights about ML, which 
we believe cast new light on the problem. In particular, we present two LP-relaxations for ML, and prove that 
these have (small) constant integrality gap. Our first LP is a specialization of our LP-relaxation for MLUFL. 
Interestingly, the integrality-gap bound for this LP relies only on the fact that the natural LP relaxation for 
TSP has constant integrality gap (i.e., a p-integrality gap for the natural TSP LP relaxation translates to an 
O(p) -integrality gap). In contrast, the various known algorithms for ML El [TO] [TJ all utilize algorithms for 
the arguably harder /c-MST problem or its variants. Our second LP has exponentially-many variables, one 
for every path (or tree) of a given length bound, and the separation oracle for the dual problem is a rooted path 
(or tree) orienteering problem: given rewards on the nodes and metric edge costs, find a (simple) path rooted 
at r of length at most B that gathers maximum reward. We prove that even a bicriteria approximation for 
the orienteering problem yields an approximation for ML while losing a constant factor. This connection 
between orienteering and ML is known lfl4l . But we feel that our alternate proof, where the orienteering 
problem appears as the separation oracle required to solve the dual LP, offers a more illuminating explanation 
of the relation between the approximability of the two problems. (In fact, the same relationship also holds 
between MGL and "group orienteering.") 

We believe that the use of LPs opens up ML to new venues of attack. A good LP-relaxation is beneficial 
because it yields a concrete, tractable lower bound and handle on the integer optimum, which one can 
exploit to design algorithms (a point repeatedly driven home in the field of approximation algorithms). Also, 
LP-based techniques tend to be fairly versatile and can be adapted to handle more general variants of the 
problem (more on this below). Our LP-rounding algorithms exploit various ideas developed for scheduling 
and facility-location problems (e.g., a-points) and polyhedral insights for TSP, which suggests that the 
wealth of LP-based machinery developed for these problems can be leveraged to obtain improvements for 
ML. We suspect that our LP-relaxations are in fact better than what we have accounted for, and consider 
them to be a promising direction for making progress on ML. 

Section|4]showcases the flexibility afforded by our LP-based techniques, by showing that our algorithms 
and analyses extend with little effort to handle various generalizations of MLUFL (and hence, ML). For exam- 
ple, consider the setting where the latency-cost of a client j is A (time taken to reach the facility serving j), 
for some non-decreasing function A(.). When A is convex and has "growth" at most p (i.e., X(cx) < c p X(x)), 
we derive an 0(max{(p log 2 n) p ,p log n log m}) -approximation for MLUFL, an O (2°^) -approximation 
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for related MLUFL and ML, and an 0(l)-approximation for metric uniform MLUFL. (Concave As are even 
easier to handle.) This in turn leads to approximation guarantees for the £ p -norm generalization of MLUFL, 
where instead of the sum (i.e., £i-norm) of client latencies, the £ p -norm of the client latencies appears in 
the objective function. The spectrum of C p norms tradeoff efficiency with fairness, making the £ p -norm 
problem an appealing problem to consider. We obtain an 0(plognmax{logn, log m}) -approximation for 
MLUFL, and an 0(l)-approximation for the other special cases. Another notable extension is the /c-route 
version of the problem, where we may use k paths starting at r to traverse the open facilities. With one sim- 
ple modification to our LPs (and algorithms), all our approximation guarantees translate to this setting. As a 
corollary, we obtain a constant-factor approximation for the £ p -norm version of the k-traveling repairmen 
problem fffl (the A; -route version of ML). 

Related work. To the best of our knowledge, MLUFL and MGL are new problems that have not been 
studied previously. There is a great deal of literature on facility location and vehicle routing problems (see, 
e.g., |[25l l32l ) in general, and UFL and ML, in particular, which are special cases of our problem, and we 
limit ourselves to a sampling of some of the relevant results. The first constant approximation guarantee 
for UFL was obtained by Shmoys, Tardos, and Aardal |[29l via an LP-rounding algorithm, and the current 
state-of-the-art is 1.5-approximation algorithm due to Byrka (H. The minimum latency (ML) problem seems 
to have been first introduced to the computer science community by Blum et al. 0, who gave a constant- 
factor approximation algorithm for it. Goemans and Kleinberg |fl9l improved the approximation factor to 
10.78, using a "tour-concatenation" lemma, which has formed a component of all subsequent algorithms 
and improvements for ML. The current-best approximation factor for ML is 3.59 due to Chaudhuri, Godfrey, 
Rao and Talwar iTTOl . As mentioned earlier, MGL with a uniform time metric captures the min-sum set 
cover (MSSC) problem. This problem was introduced by Feige, Lovasz and Tetali 11161 . who gave a 4- 
approximation algorithm for the problem and a matching inapproximability result. Recently, Azar et al. 
introduced a generalization of MSSC, for which Bansal et al. obtained a constant-factor approximation. 

If instead of adding up the latency cost of clients, we include the maximum latency cost of a client in 
the objective function of MLUFL, then we obtain the min-max versions of MGL and MLUFL, which have 
been studied previously. The min-max version of MGL is equivalent to a "path-variant" of GST: we seek a 
path starting at r of minimum total length that covers every group. Garg, Konjevod, and Ravi ifTTl devised 
an LP-rounding based 0(log 2 n log m)-approximation for GST, where n is the number of nodes and m 
is the number of groups. Charikar et al. j9j gave a deterministic algorithm with the same guarantee, and 
Halperin and Krauthgamer ll22l proved an ft(log 2 m) -inapproximability result. The rounding technique 
of IfTTl and the deterministic tree-embedding construction of [9] (rather its improvement by lfl5l ) are key 
ingredients of our algorithm for (general) MLUFL. The min-max version of MLUFL can be viewed a path- 
variant of connected facility location Il27ll20l . The connected facility location problem, in its full generality, 
is essentially equivalent to GST ll27lk however, if the connection costs form a metric, and the time- and the 
connection-cost metrics are scalar multiples of each other, then various constant-factor approximations are 
known fllBEDEl- 

Very recently, we have learnt that, independent of, and concurrent with, our work, Gupta et al. lf2TTl also 
propose the minimum group latency (MGL) problem (which they arrive at in the course of solving a different 
problem), and obtain results similar to ours for MGL. They also obtain an 0(log 2 n)-approximation for MGL, 
and the reduction from GST to MGL with a log m-factor loss (see also ll26ll ). and relate the approximability 
of the MGL and "group orienteering" problems. Their techniques are combinatorial and not LP-based, and 
it is not clear how these can be extended to handle facility-opening costs. 
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2 LP-rounding approximation algorithms for MLUFL 



We can express MLUFL as an integer program and relax the integrality constraints to obtain a linear program 
as follows. We may assume that d„' is integral for alii, i' € T U {r}. Let E denote the edge-set of the com- 
plete graph on T U {r} and let d max := max eG £ d e . Let T < min{n, m}(f max be a known upper bound on 
the maximum activation time of an open facility in an optimal solution. For every facility i, client j, and time 
t < T, we have a variable y^t indicating if facility i is opened at time t or not, and a variable Xijj indicating 
whether client j connects to facility % at time t. Also, for every edge eeB and time t, we introduce a vari- 
able z e j which denotes if edge e has been traversed by time t. Throughout, we use i to index the facilities in 
F, j to index the clients in V, t to index the time units in [T] := {1, . . . , T}, and e to index the edges in E. 

min ^ fiVi,t + ( Ci -? + t ) Xi 3' t ^ 

i,t j,i,t 

s.t. ^ x ij,t > 1 for all j; x ijjt < y i>t for all i, j, t 

i,t 

y~] d e z ej ( < i for all t (2) 

e 

^ 2 e ,t > ^ Xi 3 - |t ' for all t, 5 C (3) 

e&6(S) ieS,t'<t 

Xij,t,Ui,t,z e ,t >0 for alH, j',t,e; Ui,t = for alH, t with d ir > t. 

The first two constraints encode that each client is connected to some facility at some time, and that if a 
client is connected to a facility i at time t, then i must be open at time t. Constraint §2} ensures that by time 
t no more than t "distance" is covered by the tour on facilities, and (f3]) ensures that if a client is connected 
to i by time t, then the tour must have visited i by time t. We assume for now that T = poly(m), and show 
later how to remove this assumption (Lemma 12771 Theorem 12- 8b - Thus, ® can be solved efficiently since 
one can efficiently separate over the constraints (f3]). Let (x, y, z) be an optimal solution to (0, and OPT 
denote its objective value. For a client j, define C* = Y^i t c ij x ij,t, and L* = ^ t txijj- We devise various 
approximation algorithms for MLUFL by rounding (x, y, z) to an integer solution. 

In Section |2~T1 we give a polylogarithmic approximation algorithm for (general) MLUFL (where the CjjS 
need not even form a metric). Complementing this result, we prove (Theorem 12.91 ) that a p-approximation 
algorithm (not necessarily LP-based) for MLUFL yields an O(plog m) -approximation algorithm for the GST 
problem on n nodes and m groups. In Sections I2T21 and 1231 we obtain significantly-improved approximation 
guarantees for various well-motivated special cases of MLUFL. Section l2T2l obtains a constant-factor approx- 
imation algorithm in the natural setting where the connection costs form a metric that is a scalar multiple 
of the time-metric. Section [23] considers the setting where the time-metric is the uniform metric. Our main 
result here is a constant-factor approximation for metric connection costs, which is obtained via a rather 
versatile reduction of this uniform MLUFL problem to UFL and uniform MLUFL with zero-facility costs. 

2.1 An O (log n • max{log n, log m}) -approximation algorithm 

We first give an overview. Let Nj = {i G T : Cij < AC*} be the set of facilities "close" to j, and define tj 
as the earliest time t such that J2 ieN . t r <t Xijj> > |. By Markov's inequality, we have J2 ieN . J2 t Xij t t > | 
and tj < 12L*. It is easiest to describe the algorithm assuming first that the time-metric d is a tree metric. 
Our algorithm runs in phases, with phase i corresponding to time tf = 2 . In each phase, we compute a 
random subtree rooted at r of "low" cost such that for every client j with Tj < te, with constant probability, 
this tree contains a facility in Nj. To compute this tree, we utilize the rounding procedure of Garg-Konjevod- 
Ravi (GKR) for the group Steiner tree (GST) problem ifTTl (see Lemma [24b . by creating a group for each 
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client j with Tj < tg comprising of, roughly speaking, the facilities in Nj. We open all the facilities included 
in the subtree, and obtain a tour via the standard trick of doubling all edges and performing an Eulerian tour 
with possible shortcutting. The overall tour is a concatenation of all the tours obtained in the various phases. 
For each client j, we consider the first tree that contains a facility from Nj (which must therefore be open), 
and connect j to such a facility. 

Given the result for tree metrics, an oft-used idea to handle the case when d is not a tree metric is to 
approximate it by a distribution of tree metrics with O(log n) distortion [15]. Our use of this idea is however 
more subtle than the typical applications of probabilistic tree embeddings. Instead of moving to a distribution 
over tree metrics up front, in each phase I, we use the results of (9l [151 to deterministically obtain a tree Te 
with edge weights {dj- t (e)}, such that the resulting tree metric dominates d and E e =(« i') d % (h i')z e ,t e = 
O(logn) ^2 e d e z e j r As we show in Section |4j this deterministic choice allows to extend our algorithm 
and analysis effortlessly to the setting where the latency-cost in the objective function is measured by a 
more general function (e.g., the £ p -norm) of the client-latencies. The algorithm is described in detail as 
Algorithm [B and utilizes the following results. Let r max = max.,- Tj. 

Theorem 2.1 (Il9l ll51 ) Given any edge weights {$ e }e£E> one can deterministically construct a weighted 
tree T having leaf-set T U {r}, leading to a tree metric, dj-(-), such that, for any £ T U {r}, we have: 

(i) d T (i,i') > d iti f, and(ii) J2e=(i,i')eE d r{h i')li,i> = O(logn) £ e d e ^ e . 

Theorem 2.2 ( 11171 ) Consider a tree T rooted at r with n leaves, subsets G\,..., G p of leaves, and frac- 
tional values 3 e on the edges ofT satisfying $(5(S)) > Uj for every group Gj and node-set S such that 
Gj C 5, where Vj £ [o> !]• There exists a randomized poly time algorithm, henceforth called the GKR 
algorithm, that returns a rooted subtree T" C T such that (i) Pr[e £ T"] < lefor every edge e £ TV and 

(ii) Pi[T" n Gj = 0] < ^(-^g—) for every group Gj. 



Algorithm 1 Given: a fractional solution (x, y, z) to (0 (with C* , L*,Nj, and Tj defined as above for each client j). 

Al. In each phase £ = 0, 1, . . . , N := [log 2 (2"r max ) + 41og 2 m~\, we do the following. Let tt = min{2 , T}. 

ALL Use Theorem l2.1l with edge weights {z e ^ tl } to obtain a tree Ti = (V(Te), E(Te))- Extend Ti to a tree T/ by 
adding a dummy leaf edge (i, Vi) of cost fi to Ti for each facility i. Let E' = {(i, Vj) : i £ J-}. 

A1.2. Map the LP-assignment {z e ^ tl } e ^E to an assignment 3 on the edges of 77 by setting 3 e = 

Eeiies on the unique i-i' path in % Z H' M for all e £ E(T e ), and 3e = £t<t 4 W,t for all e = (i,Vi) E E'. Note 

mat E eG _B(r f ) d Ti ( e )te = Ee=(i,i') d Ti {h i')Ze,t e = 0(\og n) ^ e d e Z e>U = 0(log Tl)t e . 

A 1.3. Define Df = {j : Tj < ti}. For each client j £ Di, we define the group iVj = {vi : i £ Nj}. We 
now compute a subtree T[ of Tg as follows. We obtain N := log 2 m subtrees T", . . . , T^. Each tree T" 
is obtained by executing the GKR algorithm 1921og 2 n times on the tree 77 with groups {N'f\j^£, t , and 
taking the union of all the subtrees returned. Note that we may assume that i £ T" iff (i, Vi) £ T". Set 
T' t to be any tree in {T{\ T^} satisfying (i) Yl(i, Vi )eE(T^) < 40 ' 192 lo g2 n Y,(i, Vi )eE' fitw and (ii) 
J2e£E(T{,)\E' d T e ( e ) < 40 • 192 log 2 n J2eeE(Ti) d 7i { e )le\ if no such tree exists, the algorithm fails. 

A1.4. Now remove all the dummy edges from T' t , open all the facilities in the resulting tree, and convert the 
resulting tree into a tour T0U17 traversing all the opened facilies. For every unconnected client j, we connect 
j to a facility in Nj if some such facility is open (and hence part of Tour^). 

A2. Return the concatenation of the tours Tou for I = 0, 1 , . . . , M shortcutting whenever possible. This induces an 
ordering of the open facilities. If some client is left unconnected, we say that the algorithm has failed. 



Analysis. The algorithm may fail in steps A. 1.3 and A2. Lemmas [2.4l and l231 bound the failure probability 
in each case by 1/ poly(m). To bound the expected cost conditioned on success, it suffices to bound the 
expectation of the random variable that equals the cost incurred if the algorithm succeeds, and is otherwise. 
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Since each client j is connected to a facility in TV,-, the total connection cost is at most 4 V C*. To 

•* — 3 3 

bound the remaining components of the cost, we first show that in any phase I, the 3 -assignment defined 
above "covers" each group in {N'Aj & r >l to an extent of at least | (Claim l2~3l) . Next, we show in Lemma [2~4l 
that for every client j E Dp, the probability that a facility in Nj is included in the tree T[, and hence opened 
in phase i, is at least §'§ — §• The facility-cost incurred in a phase is 0(logn)Y^,i t fiUi,t, and since 
r max < T = poly(m), the number of phases is 0(log m), so this bounds the facility-opening cost incurred. 
Also, since the probability that j is not connected (to a facility in Nj) in phase £ decreases geometrically 
(at a rate less than 1/2) with I when ti > Tj, one can argue that (a) with very high probability (i.e., 
1 — 1/ poly(m)), each client j is connected to some facility in Nj, and (b) the expected latency-cost of j is 
at most O(logn) E e e£(T,) d T e (e)de = 0(log 2 n)Tj. 

Claim 2.3 Consider any phase I. For any subset S of nodes of the corresponding tree T[ with r ^ S, and 
any Nj C S where j G Dt, we have $(5(S)) > ^2 ieN . t<te Xy,* > 2/3 (where 5(S) denotes 5^(S)). 

Proof : Let R = S n V(%), and Y C T be the set of leaves in R. Then 5{S) = S Te (R) U (6(S) n E'). 
Let S E (Y) = {(t,i') G E : \{i,i'}\ D Y\ = 1}. Observe that }{5 Te {R)) > £ eeMy) z eM . This is simply 
because if we send zn> t t flow along the unique (i, i') path in Ti for every (i, i') G #e(Y"), then we obtain 
a flow between Y and T U {r} \ Y" respecting the capacities {$e}e&E(T e )> an ^ of value equal to the RHS 
above. Thus, the inequality follows because the capacity of any cut containing Y must be at least the value 
of the flow. Since (x, y, z) satisfies (f3]), we further have that Yle&s E (Y) z e,ti — YlieY,t<t e x ijJ- $° 

eeS T JR) (i,Vi)e8(S):i£Yt<t t i£Y,t<t e ieNj\Y,t<t t i€Nj,t<t e 

■ 

Lemma 2.4 In any phase I, with probability 1 — 1/ poly(m), we obtain the desired tree Ti in step A1.3. 
Moreover, Pr[T^ n N'j ^ 0] > 5/9 for all j G D t . 

Proof : Consider any tree T" obtained by executing the GKR algorithm 192 log 2 n times and taking the 
union of the resulting subtrees. For brevity, we denote vAeElT") fi Dv F(T"), and J2e&E(T")\E> d Te ( e ) 
by dt(T?). 

For j G Di, let denote the event that T" n N'j is non-empty. By Theorem 12.21 we have Pr[EJ] > 
1 - exp(- 641 ^ gan • 192 log n) > 1 - e~ 3 ^ > (1 - e~ 3 )vj > 11/18. We also have that E [F(T")] < 

192 log 2 n J2(i,v,)&E' fifrw and E [di( T ")] < 192 lo §2 n J2e€E(T e ) d T t (e)$ e - Let F *' and ° r denote respec- 
tively the events that F(T") < 40-192 log 2 n Y,{i, Vi )eE> /<3w and d ti T ") < 4 °- 192 lo S2 ™ £ee£(T,) (e)3e- 
By Markov's inequality each event happens with probability at least 39/40. Thus, for any j G Di, we get 
that 

Pr[E^|(F r A D r )] > Pr[E^ A F r A D r ] > 1 - (7/18 + 1/40 + 1/40) > 5/9. (4) 

Now the probability that (F r A D r ) c happens for all r = 1, . . . , N is at most (2/40)^ < 1/m 4 . Hence, 
with high probability, there is some tree T[ := T r such that both F r and D r hold, and (01) shows that 
Pr[r; n N'j + 0] > 5/9 for all j G D e . ■ 

Lemma 2.5 The probability that a client j is not connected by the algorithm is at most 1 /m 4 . Let Lj be 
the random variable equal to j 's latency-cost if the algorithm succeeds and otherwise. Then E \_Lj\ = 
0(log 2 n)te 3 , where lj (= [log 2 Tj~\) is the smallest i such that tg > Tj. 
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Proof : Let Pj be the random variable denoting the phase in which j gets connected; let Pj := M + 1 if 

j remains unconnected. We have Pr[Pj > £] < (|J J for I > £j The algorithm proceeds for at least 
4 log 2 m phases after phase ^ , so Pr [j is not connected after M phases] < 1 / m 4 . Now, 

£j < d(Tour^) < 2 d Tf (e) = O(logn) ^ £ dr ^ e ) 3e = °0°g 2n ) D 

«<Pj €<P,- e£E(T£)\E' &<Pj e£E(Te) l<Pj 

so E [L 3 ] = 0(log 2 n) Pr[Pj > 4 • < 0(log 2 n) jS* + ^ t t ■ (j) = 0(log 2 n)t ly . 

Theorem 2.6 Algorithm\I}succeeds with probability 1 — 1/ poly(rei), and returns a solution of expected cost 
0(logn • maxjlogn, logrn}) • OPT. 

Proof : Lemmas [2 .41 and 1231 show that the failure probability is 1/ poly (m). Let Y denote the cost incurred 
if the algorithm succeeds, and otherwise. Since < 2tj = 0(L*) for each j, we have E[Y] = 
O (log n ■ maxjlog n, log m}) • OPT by Lemma 1231 and the preceding arguments. ■ 



Removing the assumption T = poly(m). We first argue that although dPj) has a pseudopolynomial 
number of variables, one can compute a near-optimal solution to it in polynomial time (Lemma [2/7]) by con- 
sidering only (integer) time- values that are powers of (1 + e) (rouhgly speaking). Given e > 0, define T r = 
[(1 + e) r ], and let TS := {To, Ti, . . . , T^} where k is the smallest integer such that T& > min{n, m}d max . 
Define T_x = 0. Let ® jg denote (0 when t ranges over TS. 

Lemma 2.7 For any e > 0, we can obtain a solution to (0 of cost at most (1+e) OPT in time poly (input size, 1 / e). 

Proof : We prove that the optimal value of ®js is at most (1 + e)OPT. Since the size of ®js is 
poly (input size, 1/e) this proves the lemma. 

We transform (x, y, z), an optimal solution to © to a feasible solution (x', y', z') to ®js °f cost at 
most (1+e) OPT. (In fact, the facility-opening and connection-costs remain unchanged, and the latency-cost 
blows up by a (l+e)-factor.) z' is simply a restriction of z to the times in TS, that is, z' et = z e t for each e, t G 
TS. Set 

x 'ij,i = x ij,i' y'i,i ~ Vh 1 f° r a1 ^ * and 3- F° r eacri i — 1, • • • , k, facility i, client j, we set x' i j Te — 

ESt m+ i% and y'i,T t = ESth+iW,*- 11 is clear that EieTS^t = Et^j,t and EteTSVlt = 
J2 t Vi,t for all i and j, and moreover for any T e G TS, we have J2teTS-.t<T e x ij,t = Et<T f x ij,t- lt fol - 
lows that (x',y',z') is a feasible solution to ® TS and T,i,teTS fiVi,t = T,i,tfiVi,t, T,j,i,teTS c ij x 'ij,t = 
J2j it c ij x ij,t- To bound the latency cost, note that for any t > Tg_i, we have T^ < (1 + e)t, so 
for any client j and facility i, Y.teTS tx 'ij,t ^ x ij,i + i 1 + e )J2t>i tx ij,t < 0- + e )Y.t tx ij,t- Thus » 
Ej,i,t G TS tx u',t < (! + e ) Ej,i,t ■ 
Let (x', y', z') denote an optimal solution to © TS . The only changes to Algorithm [T] are in the definition 
of the time ti and the number of phases M. (Of course we now work with (x',y',z'), and C*, L*- are 
defined in terms of (x', y', z') now.) The idea is to define tg so that one can "reach" the Tj of every client j 
in 0(log m) phases; thus, one can terminate in 0(log m) phases and thereby obtain the same approximation 
on the facility-opening cost. Let L = L*)/m = (Ylj i teTS ^ x 'ij t)/ m - F° r x < T&, define TS(x) to be 
the earliest time in TS that is at least x; if x > T^, define TS(x) := Tfc. Note that TS(x) < (1 + e)x for all 
x > 0. We now define tf = TS(L ■ 2 £ ), and set the number of phases to M := [log 2 (2r max /L) + 4 log 2 m\ . 
Note that since Tj = 0(L*), we have M = 0(log m). 

Theorem 2.8 For any e > 0, Algorithfn\J}with the above modifications succeeds with high probability and 
returns a solution of expected cost O (log n maxjlog re, log m}) (1 + e) OPT. 
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Proof : The analysis of the facility-opening and connection-cost is exactly as before (since N = 0(log to)). 
Define lj as the smallest I such that t% > Tj. Note that lj < [log 2 (2rj/L)J (this holds even when Tj < 
L). Hence, the probability that j is not connected after M phases is at most 1/to 4 . So as before, the 
failure probability is at most 1/ poly(m). We have Yle<e- ^ — (-*- e )-^Sf<£ ^ — 2(1 + e)^, and 
^£>£- ^(1) = 0(t£.). Thus, the inequalities involving Lj and E[Lj] in Lemma 1231 are still valid, 
and we obtain the same bound on E[Lj] as in Lemma 1231 Note that tg. < 2(1 + e)Tj = 0(L*) when 
Tj > L. Thus, £ j E [Lj] < 0(log 2 n) [to • L + £ . ;r . >to 0(L*)] = 0(log 2 n)L*. ■ 

Inappproximability of MLUFL. We argue that any improvement in the guarantee obtained in Theorem l2.6l 
would yield an improvement in the approximation factor for GST. We reduce GST to MGL, the special case 
of MLUFL mentioned in Section [Q where we have groups Gj C T and the goal is to order the the facilities 
so as to minimize the sum of the covering times of the groups. (Note that Theorem 12 . 6 l implies an 0(log 2 n)- 
approximation for MGL.) Recall that we may assume that the groups in MGL are disjoint, in which case the 
connection costs form a metric. 

Theorem 2.9 Given a p nm -approximation algorithm for MGL with (at most) n nodes and m groups, we 
can obtain an 0(p n ^ m \ogm)-approximation algorithm for GST with n nodes and m groups. Thus, the 
polylogarithmic inapproximability of GST K22\l implies that MGL, and hence MLUFL even with metric con- 
nection costs, cannot be approximated to a factor better than f2(logm), even when the time-metric arises 
from a hierarchically well-separated tree, unless NP C ZTIME (n polylog ( n )). 

Our proof of the above theorem is LP-based and is deferred to Appendix |A] Gupta et al. |2T ] indepen- 
dently arrived at the above theorem via a combinatorial proof. 

2.2 MLUFL with related metrics 

Here, we consider the MLUFL problem when the facilities, clients, and the root r are located in a common 
metric space that defines the connection-cost metric (on JUPU {r}), and we have d uv = c uv /M for all 
u, v € TUV[j{r}. We call this problem, related MLUFL, and design an 0(l)-approximation algorithm for it. 

The algorithm follows a similar outline as Algorithm [TJ As before, we build the tour on the open 
facilities by concatenating tours obtained by "Eulerifying" trees rooted at r of geometrically increasing 
length. At a high level, the improvement in the approximation arises because one can now obtain these 
trees without resorting to Theorems 12. II and 12.21 and losing 0(log n)-factors in process. Instead, since 
the d- and c- metrics are related, we can obtain a group Steiner tree on the relevant groups by using a 
Steiner tree algorithm (in a manner similar to the LP-rounding algorithms in ll27l l20l ). We now define 
Nj = {i '■ J2t x ij,t > 0> °ij ^ 3C*}, and Tj = 6L*. Ideally, in each phase I, we want to connect the iVj 
groups for all j such that Tj < ti := 2 . But to obtain a low-cost solution, we do a facility-location-style 
clustering of the set of clients with Tj < tg and build a tree Ti that connects the NjS of only the cluster 
centers: we contract these NjS and build an MST (in the <i-metric) on them, and then connect each Nj 
internally (in the <i-metric) using intracluster edges incident on j. Here we crucially exploit the fact that the 
d- and c-metrics are related. 

Deciding which facilities to open is tricky because groups Nj and N^ created in different phases could 
overlap, and we could have C* <C but Tj ^> r^; so if we open i E Nj and use this to also serve k, then 
we must connect i to some Te (without increasing its d-cost by much) where t(_ = Oir^). We consider the 
collection C of cluster centers created in all the phases, and pick a maximal subset C C C that yields disjoint 
clusters by greedily considering clusters in increasing C* order. We open the cheapest facility in Nj for all 
j G C, and attach it to the tree Te, where I is the earliest phase such that there is some cluster N^ created 
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in that phase that was removed (from C) when Nj was included in C (because Nf. n Nj ^ 0). Since d and 
c are related, one can bound the resulting increase in the d-cost of T- Finally, we convert these augmented 
7^-trees to tours and concatenate these tours. 

We now describe the algorithm in detail. We have not sought to optimize the approximation ratio. When 
we refer to an edge or a node below, we mean an edge or node of the complete graph on JUPU {r}. Recall 

that Nj = {i : J2t x ij,t > 0, < 3C*}, and Tj = 6L*. So £ ieJVj J2t x v,t > §, and EieiV^t^ x ^ ^ 
1 

2' 

Rl. For each time t t = 2 e , where £ = 0, 1, ... , [logT], we do the following. Define Dp = {j : Tj < 
fy} \ {Uo<e'<e Ce') (where the union of an empty collection is 0). 

• (Clustering) We cluster the facilities in [jj eDe Nj as follows. We pick j G with smallest C* value 
and form a cluster around j consisting of the facilities in Nj. For every client k G (including j) 
such that Cjk < 30C^ (note that C£ > Cj), we remove k from Dp, set a(k) = j, and recurse on 
the remaining clients in until no client is left in D^. Let Cg denote the set of cluster centers (i.e., 
{j G D t : a(J) = j}). Note that for two clients j and j' in C £ , Nj n Nj> is 0. 

• (Building a group Steiner tree Te on {Nj}j & c e ) We contract the clusters Nj for j G into supern- 
odes, and build a minimum spanning tree (MST) T" connecting r and these supernodes. Next, we 
uncontract the supernodes, and for each j € Ce, we add edges joining j to every facility i € Nj that 
has an edge incident to it in TJ'. This yields the tree Te- 

R2. (Opening facilities) Let C = [JgCe. (Note that a client appears in at most one of the Ct sets.) We 
cannot open a facility in every cluster centered around a client in C, since for j G Q and k G C#, Nj 
and Nk need not be disjoint. So we select a subset C C C such that for any two j, k in C , the sets Nj 
and Nj, are disjoint. This is done as follows. We initialize C ^— 0. Pick the client j G C with smallest 
C* value and add it to C . We delete from C every client k G C (including j) such that N k n JVj- / 0, 
setting nbr(fc) = j, and recurse on the remaining set of clients until no client is left in C. 

Consider each j G C. We open the facility i G Nj with smallest /j. Let £ be the smallest index such 
that there is some k G Cg with nbr(fc) = j. We connect i to Te by adding the facility edge (i, k) to it. 
Let 7/ denote 7^ augmented by all such facility edges. 

R3. We obtain a tour connecting all the open facilities, by converting each tree T[ into a tour, and concate- 
nating the tours for £ = 0, ... , [T] (in that order). 

R4. For every client j G C, we assign j to the facility opened from N nbr ^y For every client j G' C, we 
assign j to the same facility as a(j). 

Theorem 2.10 For related MLUFL, one can round (x, y, z) to get a solution with facility-opening cost at 
most I t fiHij, where each client j incurs connection-cost at most 39Cj and latency-cost at most 64tj = 
384L*. Thus, we obtain an 0{\)-approximation algorithm for related MLUFL. 

Proof : The clusters Nj for clients j G C are disjoint; each such cluster has facility weight X^igTV Et Vi,t ^ 
|, and we open the cheapest facility in the cluster. 

Consider a client j, and let it be assigned to facility i. If j G C, then i G Nj, and we have Qj < 3Cj\ 
If j G C \ C with nbr(j) = k, then i G N k and there is some facility i' G Nj n A^fe. So Cij < c ik + 
Q'fc + Ci'j < 2 • 2>Cl + 3C* < 9C*. Finally, if j £ C and f = a(j), then a(j') is also assigned to i, so 
cji < + Cjj, < 9C*, + 30C* < 39C*. 

We next bound d(Te) and d(7^') for any phase £. For any client j G -t^ and any node-set S ^ Nj, r ^ S, 
we have £ ee a(s) ^ > EieS,t< tl x ij,t > Y,i^N ] ,t<r ] x ii,t > \- Therefore, (2ze,t t ) forms a fractional 
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Steiner tree of cost at most 2tp on the supernodes and r, and hence, d(T") < 4i^ since it is well known that 
the cost of an MST is at most twice the cost of a fractional solution to the Steiner-tree LP. For j 6 Ce, let 
degj denote the degree of the cluster Nj in 7j'. Observe that if e is an edge of T" joining Nj and (so 
j, k 6 Ci), then c e > 24max{C*, C* k }, since 30max{C*, C* k } < c jk < 3C* + c e + 3C£. So the (d-) cost 
of adding the additional edges to T" is at most jj ■ Yljec t d e Sj '3C 1 * < d(T")/4, and hence, <i(7i) < 5^. 

Now consider the cost of adding facility edges to 7~e in step R2. For each facility edge (i, k) added, we 
know that i G N n b r (k), k G Ce, and k is assigned to i. So we have cik < 9C£. Observe that each client 
k G Ci is responsible for at most one such facility edge. So the <i-cost of these facility edges is at most 
h ■ E jeC , 9C* < ^ E ieC , deg,- -9C* < | • d(7?')- Thus, d{T[) < d(T e ) + f ■ dffl) < 8t e . 

Finally, we prove that the latency cost of any client j is at most 64-Tj = 384L;- . Let j be assigned 
to facility i. Let £ be the smallest index such that j G Di, so ^ < 2rj. We first argue that if i is part 
of the tree 7],, then £' < I. If j G C, this follows since we know that i G N nbr ^ and £' = min{r : 
3k G C r with nbr(fc) = nbr(j)}. If j £ C, then we know that a(j) G Q is also assigned to i, and 
so by the preceding argument, we again have that £' < £. Thus, the latency-cost of j is bounded by 

Er=0 2d(7?) < 16 Y.r=0 tr < 32t ?' < 64T i" ■ 

2.3 MLUFL with a uniform time-metric 

We now consider the special case of MLUFL, referred to as uniform MLUFL, where the time-metric d is 
uniform, that is, da> = 1 for all G T U {r}. When the connection costs form a metric, we call it the 
metric uniform MLUFL. We consider the following simpler LP-relaxation of the problem, where the time t 
now ranges from 1 to n. 

min ^ fiVi,t + ^(cjj + t)xjj,t subject to (Unif-P) 

i,t j,i,t 

^ Xjj,t > 1 Vj; Xi^ t <Vi,t Vi,j,t; y~)yi,t < 1 Vt; x ijit ,y ijt >0 Vi,j,t. 

i,t i 

Let be an optimal solution to (lUnif-PI ). and OPT be its value. Let C* = ^2n c ij x ij,t, L*- = 

J2itt x ij,t- As stated in the introduction, uniform MLUFL generalizes: (i) set cover, when the facility and 
connection costs are arbitrary; (ii) MSSC, when the facility costs are zero (ZFC MLUFL); and (hi) metric 
UFL, when the connection costs form a metric. We obtain approximation bounds for uniform MLUFL, ZFC 
MLUFL, and metric MLUFL (Theorems 12.1 ll and l2.14b that complement these observations. 

The main result of this section is Theorem 12. 121 which shows that a p(jFi_- a PP rox i ma ti° n algorithm for 
UFL and a 7-approximation algorithm for ZFC MLUFL (with metric connection costs) can be combined to 
yield a (pufl + 27)-approximation algorithm for metric uniform MLUFL. Taking /3ufl = 1-5 111 and 7 = 9 
(part (ii) of Theorem 12.1 II ). we obtain a 19.5-approximation algorithm. We improve this to 10.773 by using 
a more refined version of Theorem 12.121 which capitalizes on the asymmetric approximation bounds that 
one can obtain for different portions of the total cost in UFL and ZFC MLUFL. 

We note that by considering each (i, t) as a facility, since the connection costs Cy + 1 form a metric, one 
can view metric MLUFL as a variant of metric UFL, and use the ideas in Q to devise an 0(l)-approximation 
for this variant. We instead present our alternate algorithm based on the reduction in Theorem l2.12l because 
this reduction is quite robust and versatile. In particular, it allows us to: (a) handle certain extensions of 
the problem, e.g., the setting where we have non-uniform latency costs (see Section @]), for which the above 
reduction fails since we do not necessarily obtain metric connection costs, and (b) devise algorithms for the 
uniform latency versions of other facility location problems, where the cost of a facility does not depend 
on the client-set assigned to it (so one can assign a client to any open facility freely without affecting the 
facility costs). For instance, consider uniform MLUFL with the restriction that at most k facilities may be 
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opened: our technique yields a (pkMed + 27) approximation for this problem, using a pfcMed- a PP rox i ma ti° n 
for fc-median. 

Theorem 2.11 One can obtain: 

(i) an O(mm)- approximation algorithm for uniform MLUFL with arbitrary facility- and connection- costs. 

(ii) a solution of cost at most J2j C* + - |~— ] Ylj Lj f or ZFCMLUFL, for any parameter a € (0, 1). 
Thus, setting a = |, yields solution of cost at most 9 • OPT. 

We defer the proof of Theorem 12.1 II to the end of the section, and focus first on detailing the aforemen- 
tioned reduction. 

Theorem 2.12 Given a p\jf [^-approximation algorithm A\for UFL, and a ^-approximation algorithm A2 
for uniform ZFC MLUFL, one can obtain a (p\jf\_+2^/)-approximation algorithm for metric uniform MLUFL. 

Proof : Let X denote the metric uniform MLUFL instance, and O* denote the cost of an optimal integer 
solution. Let Xjfl be the UFL instance obtained form X by ignoring the latency costs, and Zzfc be the ZFC 
MLUFL instance obtained from X by setting all facility costs to zero. Let O ufl and 0^ fC denote respectively 
the cost of the optimal (integer) solutions to these two instances. Clearly, we have UFL , O^fc — O* • We 
use A\ to obtain a near-optimal solution to Xufl' l et F\ be the set of facilities opened and let <r\{j) denote 
the facility in F\ to which client j is assigned. So we have ^2 ieFl fi + Ylj c <r\{j)j — /°ufl ■ 0^ FL . We use A2 
to obtain a near-optimal solution to Xzfo let F2 be the set of open facilities, 02 (j) be the facility to which 
client j is assigned, and 7r(z) be the position of facility i. So we have . { c a 2 (j)j + 7r ( cr 2(j))) < 7 ■ Ozfc- 
We now combine these solutions as follows. For each facility i € F%, let p(i) € F\ denote the facility 
in F\ that is nearest to i. We open the set F = {p(i) : i € F2} of facilities. The position of facility 
i G F is set to minj/ e ^ 2:7r (j/) = j Each facility in F is assigned a distinct position this way, but some 

positions may be vacant. Clearly we can always convert the above into a proper ordering of F where each 
facility i G F occurs at position k{i) < minj/g^.^jA—j 7r(i'). Finally, we assign each client j to the 
facility 4>{j) = M°2(j)) G F. Note that < 7r(o"2(j)) (by definition). For a client j, we now have 

C^j)j < Ca 2 (j)n(a 2 (j)) + C a 2 (j)j < C a 2 (j)ai(j) + C <? 2 (j)j ^ C °l(j)j + 2c <r 2 {j)j- ThuS > the t0tal COSt ° f the 

resulting solution is at most J2ieFj /« + Y2j{ c cn(j)j 

+ 2c U2{j)j + Tr(a 2 (j))) < (pufl + 2 7 ) • O*. m 

We call an algorithm a (pj, p c ) -approximation algorithm for UFL if given an LP-solution to UFL with 
facility- and connection- costs F* and C* respectively, it returns a solution of cost at most pfF* + p c C*. 
Similarly, we say that an algorithm is a (7 C , 7^) -approximation algorithm for uniform ZFC MLUFL if given 
a solution to (0 with connection- and latency- costs C* and L* respectively, it returns a solution of cost at 
most 7 C C* + 7/L*. The proof of Theorem 12 . 1 2 1 easily yields the following more general result. 

Corollary 2.13 One can combine a (pf, p c )-approximation algorithm for UFL, and a (jc, ^^-approximation 
algorithm for uniform ZFC MLUFL, to obtain a solution of cost at most max{/?j, p c + 27 c , 7^} • OPT. 

Proof : The proof mimics the proof of Theorem 12. 121 The only new observation is that (x, y) yields an 
LP-solution to (i) Xufl with facility cost ^ t fiUij and connection cost J2j i t c ij x ij,t'> an d (ii) Xzfc with 
connection cost Ylj % t c ij x ij,t an d latency cost J2j i t ^ x ij,t- Thus, applying the construction in the proof of 
Theorem 12. 121 yields a solution of total cost at most 

Pf z~2 f iVi t + ( pc + 27c ) Ci i Xi h l + ^ l Z~2 tXi it - max {/°/> Pc + 2 7c 11} ■ OPT. 

i,t j,i,t j,i,t ■ 

Combining the ( ln []/g , j=r) -approximation algorithm for UFL |[29l with the ( j^, ^ |~^] ) -approximation 
algorithm for ZFC MLUFL (part (ii) of Theorem 12.1 II ) gives the following result. 
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Theorem 2.14 For any a, (3 € (0, 1), one can obtain a solution of cost max{ i^/g , j^rp + rr^i a \a \ } ' 
OPT. Thus, taking a = 0.7426, /3 = 0.000021, we obtain a 10. 773-approxbnation algorithm. 

2.3.1 Proof of Theorem GUI 

The following lemma will often come in handy. 

Lemma 2.15 Let (x, y) be a solution satisfying ^ y^t < k for every time t, where k > is an integer, and 
all the other constraints of (lUnif-PI) . Then, one can obtain a feasible solution (V, y') to (lUnif-PI ) such that 

0) Ei,* fiVi,t = Ei,* ( u ) Ej,m = Ej,m Cij^ij,*' w C"^ Ei,i,t te W ^ fc • Ei,i,t 

Proof : For each time t, define St = {(i, t) : y^t > 0}. The idea is to simply "spread out" the St sets. Let 
So = U t 5 t be an ordered list where all the (•, t) pairs are listed before any (•, t + 1) pair, and the pairs for 

a given t (i.e., (i,t) € St) are listed in arbitrary order. Let T = J2i t Vit ■ We divide the pairs in Sq into 

To groups as follows. Initialize £ <— 1, S <— So. For a set A of (i, t) pairs, we define the y- weight of A as 

y( A ) = E(i,t) eJ 4 Vi,t, and the %-weight of A as E(i,t)eA If < yO^) ^ then we set G t = S to 
end the grouping process. Otherwise, group Gg includes all pairs of S, taken in order starting from the first 
pair, stopping when the total y- weight of the included pairs becomes at least 1 ; all the included pairs are also 
deleted from 5. If the y-weight of Gg now exceeds 1, then we split the last pair (i, t) into two copies: we 
include the first copy in Gg and retain the second copy in S, and distribute y^ t across the y-weight of the two 
copies so that y{Gg) is now exactly 1. (Thus, the new y-weight of S is precisely its old y-weight —1.) Also, 
for each client j, we distribute Xijj across the x 3 -weight of the two copies arbitrarily while maintaining that 
the Xj -weight of each copy is at most its y-weight. We update £ <— £ + 1, and continue in this fashion with 
the current (i.e., ungrouped) list of pairs S. Note that an (i, t) pair in Sq may be split into at most two copies 
above (that lie in consecutive groups). To avoid notational clutter, we call both these copies (i, t) and use 
yf t to denote the y-weight of the copy in group Gi (which is equal to the original y^t if (i, t) is not split). 
Analogously, we use xfj t to denote the x 3 -weight of the copy of (i, t) in group Gg. 

For every facility i, client j, and £ = 1, . . . , T , we set y' i t = Et:(i,t)€G t Pi,t and x ij,e = E f: (i,t) eG< 
so we have x' ij£ < y' i t It is clear that ^2g y ' i t = Et2/M and Y^e x 'ij,e = Et^ij,* for ever y facility * and 
client j. Thus, (a/, y') is feasible to (lUnif-PI) . and parts (i) and (ii) of the lemma hold. To prove part (iii), 
note that if (some copy of) (i, t) is in Gg, then £ < kt, since then we have (Ur=i G r) C Ut'=i Stf and so 
£ — 1 < kt. Thus, for any client j, we have 

E'( E 4,*)=E E ^kt<E kt E = 

i,£ i,£ t:(i,t)<SGi i,t k(i,t)eG e i,t l:(i,t)eG e i,t 



Proof of part (i) of Theorem 12.1 II : We round the LP-optimal solution (x, y) by using filtering followed by 
standard randomized rounding. Clearly, we may assume that t xy^t = 1 an d y%,t = max, Xij t t for every 
i,j, t. Also, we may assume that if ^ Vt,t+i > 0, then Y^i Vi.t = L because otherwise for some facility i 
and some e > 0, we may decrease y^t+i by e and increase y^t by e, and modify the {xij^+i, Xij,t}j values 
appropriately so as to maintain feasibility, without increasing the total cost. Define Nj = {(i,t) : Cij + t < 
2(C* + L*)} for a client j. The algorithm is as follows. 

Ul. For each (i, t), we set Yn = 1 independently with probability min{41nm • y^t, 1}. 

U2. Considering each client j, if {(i,t) € Nj : Yi± = 1} = 0, then set Yi^\ = 1 where ij is such that 
fi. = miH( i)t ) e jVj /» ( note that 1) ^ N j)- Let 5j = {(*>*) € iVj : = 1} (which is non-empty). 
Assign each client j to the (i, t) pair in Sj with minimum Cjj + t value, i.e., set Xy^ = 1. 
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U3. Let K = maxj £\ Y it . Use Lemma l2.15l to convert (X, Y) into a feasible integer solution to (IUnif-PI ). 



Let Cj and Lj denote respectively the connection cost and latency cost of client j in (X, Y). We argue 
that (i) E[£ M fiY itt ] = O(lnm) • Di,t M*. ® with probability 1, + < 2(C* + L*) for every 
client j, and (iii) If = O(mm) with high probability, and in expectation. The theorem then follows from 
Lemma 12.151 

For any client j, we have J2(i,t)eNj x ij,t > \ ( bv Markov's inequality). Thus, < 2 J2(i,t)eN 3 fiVi,t 
and PrE(j t)eN } ^i,t = 0] * s at most e~ 41nm '2 = l/?n 2 . The expected cost of opening facilities in 
step Ul is clearly at most 4 mm ■ J2itfiVi,t- The expected facility-opening cost in step U2 is at most 

Pr [facility is opened in step U2]-Y, j fij < m -^f 2m Yli,t fiVi,t- So E [Y,i,t fi Y i,t] = 0( lnm )-T,i,t fiVi,t- 
Since we always open some (i, t) pair in Nj, we have Cj + Lj < 2(C? + L*) for every client j. 

Let S = {t : Y,iVi,t > °>- Note that \ S \ < 1 + Ei.t^t ^ 1 + Ej,i,t = m + L After ste P U1 > 
we have E [£V Yi,t\ < 4 In m for every time t G S. Since the Yi t t random variables are independent, we 
also have PrKi^M > 81nm] < 1/m 2 for alH G 5 (and also, E [max ie s Yli ^»,t] = 0(\nm)). Thus, 
after step U2, we have PrEj^,i > 81nm] < 1/m 2 + 1/m and PrEj^,i > 8mm] < 1/m 2 for all 
t E S, t > 1. Hence, Pr[K > 8 mm] < 2/m. (This also shows that E [K] = O(mm).) ■ 

Proof of part (ii) of Theorem 12. 1 1 1 : We round (x, y) by applying filtering [23 followed by Lemma l2.15l to 
reduce the problem to a MSSC problem, and then use the result of Feige et al. lfl~6l to obtain a near-optimal 
solution to this MSSC problem. Let 

min } txjj s.t. ^Jxj,t > 1 V?', x jjt < ^ Vs,t Vj,*, ^ys,t<lVi, x,y>0. (PI) 
j,t t s-.jeS s 

denote the standard LP-relaxation of MSSC |[T6l (here j indexes the elements, S indexes the sets, and t 
indexes time). Feige et al. showed that given a solution (x, y) to (1P1I ). one can obtain in polytime an integer 
solution of cost at most 4 • J2j t 

The rounding algorithm for ZFC MLUFL is as follows. Define Nj = {i : Cij < C*/(l — a)}, so 
Y^ieN h t x i3,t - a - For a11 *>J'*> set Vht = Vi,t/ a > and x ij,t = %ij,t/a if i G Nj and x ij>t = otherwise. 
It is easy to see that (x, y) satisfies £\ < — for all t, and all the other constraints of (IUnif-PI ). We use 
Lemma 12.151 to convert (x, y) to a feasible solution (x 1 , y') to (|Unif-P| ). Next, we extract a solution to (IP1I ) 
from (a;', y'). We identify facility i with the set {j : i G iVj}, and set x^t = X^ejv- x ij t- Now (x, y') is a 
feasible solution to (|P1I) . Finally, we round (x, y') to an integer solution. This yields the ordering y = (y"i,t) 
of the facilities. For each client j, if j is first covered by set i (so i G Nj) at time (or position) t in the MSSC 
solution, then we set Zijj = L 

Analysis. Since a client j is always assigned to a facility in Nj, the connection cost of j is bounded by 
Cj/{1 — a). To bound the latency cost, first we bound the cost of (x, y). Since we modify the assign- 
ment of a client j by transferring weight from farther facilities to nearer ones, it is clear that t GijXijj < 
Y.1.1 c.j-'-ijj. Also, clearly Y,j,i,t t£ ij,t < h ' Y,j,i,t tx ij,t- and Ei^M ^ Thus ' aPP^g Lemma[2T5] 
yields (x',y r ) satisfying J2j.i.t tx ij,t ^ [al a ' Y.j,i,t tx ij,t- The result of dl now implies that 

Ej,i, t tx a,t < 4 Ej, t tx jtt = 4 E hi ,Mj,t < 5 [si ■ 
3 LP-relaxations and algorithms for the minimum-latency problem 

In this section, we consider the minimum-latency (ML) problem and apply our techniques to obtain LP- 
based insights and algorithms for this problem. We give two LP-relaxations for ML with constant integrality 
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gap. The first LP (ILP1I) is a specialization of © to ML, and to bound its integrality gap, we only need the 
fact that the natural subtour elimination LP for TSP has constant integrality gap. The second LP ( |LP2p[ ) 
has exponentially-many variables, one for every path (or tree) of a given length bound, and the separation 
oracle for the dual problem corresponds to an (path- or tree-) orienteering problem. We prove that even a 
bicriteria approximation for the orienteering problem yields an approximation for ML while losing a constant 
factor. (The same relationship also holds between MGL and "group orienteering".) As mentioned in the 
Introduction, we believe that our results shed new light on ML and opens up ML to new venues of attack. 
Moreover, as shown in Section [4] these LP-based techniques can easily be used to handle more general 
variants of ML, e.g., fc-route ML with £ p -norm latency-costs (for which we give the first approximation 
algorithm). We believe that our LP-relaxations are in fact (much) better than what we have accounted for, 
and conjecture that the integrality gap of both (1LP1I ) and ( |LP2p[ ) is at most 3.59, which is the currently best 
known approximation factor for ML. 

Let G = (D U {r}, E) be the complete graph on N = H- 1 nodes with edge weights {d e } that form a 
metric. Let r be the root node at which the path visiting the nodes must originate. We use e to index E and 
j to index the nodes. In both LPs, we have variables Xj t for t > dj r to denote if j is visited at time t (where 
t ranges from 1 to T); for convenience, we think of xj jt as being defined for all t, with xj jt = if dj r > t. 
(As in Section |2~T1 one can move to a polynomial-size LP losing a (1 + e)-factor.) 

A compact LP. As before, we use a variable z e i to denote if e has been traversed by time t. 

min txj t t subject to (LP1) 

i,t 

gj.t > 1 Vj; d e z e>t < t Vi; ^ z e>t > } u x j)V Vt,SCT>,j; x,z>0. 

t e eeS(S) f<t 

Theorem 3.1 The integrality gap of (1LP1I ) is at most 10.78. 

Proof : Let (x, z) be an optimal solution to (ILPlb . and L* = ^2 t txjj- For a € [0, 1], define the a-point 
°f h r j( a )' to be the smallest t such that Ylt'<t x jt' — a - Let D t (a) = {j : Tj(a) < t}. We round (x, z) 
as follows. We pick a S (0, 1] according to the density function q{x) = 2x. At each time t, we utilize the 
| -integrality-gap of the subtour-elimination LP for TSP and the parsimonious property (see |[33l[30l[T8l l6l). 
to round — and obtain a tour on {r} U D t (a) of cost Ct (a) < - ■ J2 e d e z e j < — . We now use Lemma I3T21 
to combine these tours. 

Lemma 3.2 ( II19II paraphrased) Let Touri, . . . , Tour/; be tours containing r, with Tourj having cost Ci and 
containing Ni nodes, where Nq := 1 < N\ < . . . < = N. One can find a subset Tour^ , . . . , Tourj 6= fc 
of tours, and a way of concatenating them that gives total latency at most ^ip Yli Ci(Ni — Ni-±). 

The tours we obtain for the different times are nested (as the D t (a)s are. nested). So J2t>i Ct{a)(\D t (a)\ — 
|A-i(a)|) = J2j I2p.jeD t (a)\D t - 1 (a) C t( a ) = Ej C r 3 { a ){^) < Thus ' usi ng Lemma[IIl and 

taking expectation over a (note that E [ Tj Q ] < 2L*), we obtain that the total latency-cost is at most 
(3.59- 3) ZjL*- ■ 

Note that in the above proof we did not need any procedure to solve A;-MST or its variants, but rather 
just needed the integrality gap for the subtour-elimination LP to be a constant. Also, we can modify the 
rounding procedure to ensure that (latency-cost of j) < 18tj (0.5) for each client j, as follows. (Such a 
guarantee is useful to bound the total cost when its measured as an C p norm for p > 1; see Section |U) 
We now only consider times ti = 2 . Recall that for any a E (0, 1), at each time t, we can obtain a tour 
on {r} U D t (a) of cost Ct(a) < — . We take this tour for tt, and traverse the resulting tour randomly 
clockwise or anticlockwise (this choice can easily be derandomized), and concatenate all these tours. Let 
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£j(a) be the smallest I such that tg, > Tj(a). So the (expected) latency-cost of j is at most 



< 



4.5t 



3t e 

Klj{a) a 



+ 



e illL < 9Tj ^ , Fixing a = 0.5, we obtain a 36-approximation with the per-client guarantee 



that (latency-cost of j) < 18^(0.5) for each j. 



An exponential-size LP: relating the orienteering and latency problems. Let Vt and % denote respec- 
tively the collection of all (simple) paths and trees rooted at r of length at most t. For each path P G Vt, we 
introduce a variable zpj that indicates if P is the path used to visit the nodes with latency-cost at most t. 
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Vt,P€V t (8) 



(9) 



x, z > 0. 

© and © encode that at most one path may be chosen for any time t, and that every node j visited at time 
t' < t must lie on this path. (1LD2I ) is the dual LP with exponentially many constraints. Let (LP2-7-) be the 



analogue of ( |LP2-p[ ) with tree variables, where we have variables zqj for every Q G %, and we replace all 
occurrences of zpj in ( |LP2p[ ) with zq^- 

Separating over the constraints © involves solving a (rooted) path-orienteering problem: for every t, 
given rewards {#j,t}, we want to determine if there is a path P rooted at r of length at most t that gathers 
reward more than j3 t . A (p, 7) -{path, tree} approximation algorithm for the path-orienteering problem is an 
algorithm that always returns a {path, tree} rooted at r of length at most 7(length bound) that gathers reward 
at least (optimum reward) / p. Chekuri et al. ifTll give a (2+e, l)-path approximation algorithm, whereas ifTOl 
design a (1 + e, 1 + e)-tree approximation for orienteering (note that weighted orienteering can be reduced to 
unweighted orienteering with a (l+e)-factor loss). We prove that even a (p, 7)-tree approximation algorithm 
for orienteering can be used to obtain an ©(/^-approximation for ML. First, we show how to compute a 
near-optimal LP-solution. Typically, one argues that, scaling the solution computed by the ellipsoid method 
run on the dual with the approximate separation oracle yields a feasible and near-optimal dual solution, 
and this is then used to obtain a near-optimal primal solution (see, e.g., |[24l ). However, in our case, we 
have negative terms in the dual objective function, which makes our task trickier: if our (unicriteria) p- 
approximate separation oracle determines that (a, j3, 6) is feasible, then although (a, p(3, 9) is feasible to 
( 1LD2I ). one has no guarantee on the value of this dual solution. Instead, the notion of approximation we 
obtain for the primal solution computed involves bounded violation of the constraints. 

Let (LP2^' b) ) be ( |LP2^| ) where we replace V t by V bt , and the RHS of © is now a. Let (LP2^' fe) ) be 



defined analogously. Let OPT-p be the optimal value of ( |LP2-p[ ) (i.e., (LP2^ 
lower bound on the optimum latency. 



). Note that OPT v is a 



Lemma 3.3 Given a {p^)-tree approximation for the orienteering problem, one can compute a feasible 
solution (x,z) to (LP2^' 7 ' ) ) of cost at most OPT. 
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Proof : Lemma [331 Define 



Pfeas(^;a,&) := [(a, (3,6): ©, ©, £0 i)t <& VPgP m , ^ a; - fit > v}, 

j&P j * 

Tfeas(v;a,b) := [(a, 0,0) : 0, @, < Pt VQ € 7^, J>,- - a^ft > i/}. 

Note that OPTp is the largest value of such that Pfeas(z^; 1, 1) is feasible. Given u, (a, (3,6), if there 
was an algorithm to either show (a, (3,6) G Pfeas(z^; 1,1) or exhibit a separating hyperplane, then using 
the ellipsoid method we could optimally solve for OPTp. However, such a separation oracle would solve 
orienteering exactly. 

We use the (p, 7)-tree approximation algorithm to give an approximate separation oracle in the following 
sense. Given u, (a, (3, 6), we either show (a, p(3, 6) G Pfeas(^; 1, 1), or we exhibit a hyperplane separating 
(a, (3,6) and Tfeas(z/; p, 7). Note that Tfeas(z/; p, 7) C Pfeas(^; 1, 1). Thus, for a fixed u, the ellipsoid 
method in polynomial time, either certifies that Tfeas(z^; p, 7) is empty or returns a point (a, (3, 6) with 
(q, p(3, 6) G Pfeas(z^; 1, 1). Let us describe the approximate separation oracle first, and then use the above 
fact to prove the lemma. First, check if ■ aj — p(3 t > u), (O, and © hold, and if not, we use the 
appropriate inequality as the separating hyperplane between (a, (3,6) and Tfeas(^; p, 7). Next, for each t, 
we run the (p, 7) -tree approximation on the orienteering problem specified by (G, {d e }), root r, rewards 
{6j : t}, and budget t. If for some t, we obtain a tree Q G T^t with reward greater than (3 t , then we return 
X^'eQ @j,t — fit as the separating hyperplane. If not, then for all paths P of length at most t in G, we have 
Ejep °j,t < Pfit and thus (a, pfi, 6) G Pfeas(i/; 1, 1). 

We find the largest u* (via binary search) such that the ellipsoid method run for v* with our separation 
oracle returns a solution (a* ,fi* ,6*) with (a*, pfi*, 6*) G Pfeas(^*; 1, 1); hence, we have v* < OPT-p (by 
duality). Now for e > 0, the ellipsoid method run for v* + e terminates in polynomial time certifying the 
infeasibility of Tfeas(^* + e; p, 7). That is, it generates a polynomial number of inequalities of the form f7]), 
©, and (J2j£Q @j,t < fit) where Q G Tyt, which together with the inequality J2j a j ~ PJ2t fit — v * + 6 
constitute an infeasible system. Applying Farkas' lemma, equivalently, we get a polynomial sized solution 
(x, z) to (LP2j?' 7 ^) that has cost at most u* + e. Taking e small enough (something like 1/ exp(input size) 
so that ln(l/e) is still polynomially bounded), this also implies that (x,z) has cost at most u* < OPT p. 
This completes the proof of the lemma. ■ 



Theorem 3.4 (i) A feasible solution (x, z) to (LP2j?' 7 ^) (or the corresponding LP -relaxation for MGL) can 
be rounded to obtain a solution of expected cost at most 0{p^) ■ ^ • t txj^; (ii) A feasible solution (x, z) to 

(LP2p' 7 ^) can be rounded to obtain a solution of cost at most (3.59 • 2)/ry . t txjj. 

Proof : We prove part (i) first. (Note that for ML, the analysis leading to Theorem 13.11 already implies that 
one can obtain a solution of cost at most 10.78/97 t t x j,t, because setting 3 e>t = X^QeT reeQ Z Q' 1 y^l^s 
a solution (x, 3) that satisfies J2 e dele.t < Pit and all other constraints of (1LP1I ).) We sketch a (randomized) 
rounding procedure that also works for MGL and yields improved guarantees. We may assume that each 
z Q,t ^ [0, 1]. At each time ti = 2 , I = |~log 2 T + 4 log 2 m~\ , we select at most \wt e ~\ trees from % te , 
picking each Q G Tjt e with probability zq^ v (We can always do this (efficiently) since for every time t, 



thepolytope {3 G [0, l] r " : EohQ,t < 



J2q zqj } is integral.) We take the union of all these trees. 



Note that expected cost of the resulting subgraph is at most wtlti < pjt^. We "Eulerify" the resulting 
subgraph to obtain a tour for ti of cost at most 2pjti, and concatenate these tours. The probability that 
j is not visited (or covered) by the tour for te is at most 1 — J2t<t e x j,t, which implies that with high 
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probability, we obtain a tour spanning all nodes. Letting tj = £_/(§), the expected latency-cost of j is at 
most 2^(27}- + 2t 3 E fc >o(|) fe ) < WplTj. 

To prove part (ii), we adopt a rounding procedure that again utilizes Lemma 13.21 and yields the stated 
bound (deterministically). Recall that T? (a) denotes the a-point of j. Let D t (a) = {j : Tj(a) < t}. For 
any a € (0, 1), and time t, we now show to obtain a tour spanning D t (a) U {r} of cost at most ^1. We 
can then proceed as in the rounding procedure for Theorem 13.11 to argue that, for the tours we obtain, the 
quantity £V Ci(N{ — Ni-i) appearing in Lemma I3~l2l is bounded by Hence, choosing a as 

before according to the distribution q(x) = 2x and taking expectations, we obtain a solution with the stated 
cost. 

Let K be such that Ka, {Kzp t t}pe-p t are integers. For each P with zpj > 0, we create Kzp t t copies 
of each edge on P, and direct the edges away from r. Let Apj denote the resulting arc-set. Note that 
in At := l+Jp zp >o Ap,t> every node j G T> has in-degree at least its out-degree, and there are Ka arc- 
disjoint paths from r to j for each j G Dt(a). So applying Theorem 2.6 in Bang-Jensen et al. Hi, one can 
obtain Ka arc-disjoint out-arborescences rooted at r, each containing all nodes of D t {a). Thus, if we pick 
the cheapest such arborescence and "Eulerify" it, we obtain a tour spanning D t (a) U {r} of cost at most 

In Appendix lAl we prove an analogue of Lemma l3~31 for MGL. Combined with part (i) of Theorem |LP2y>[ 
this shows that (even) a bicriteria approximation for "group orienteering" yields an approximation for MGL 
while losing a constant factor. 

4 Extensions 

We now consider various well-motivated extensions of MLUFL, and show that our LP-based techniques and 
algorithms are quite versatile and extend with minimal effort to yield approximation guarantees for these 
more general MLUFL problems. Our goal here is to emphasize the flexibility afforded by our LP-based 
techniques, and we have not attempted to optimize the approximation factors. 

Monotone latency-cost functions with bounded growth, and higher C p norms. Consider the general- 
ization of MLUFL, where we have a non-decreasing function A(.) and the latency-cost of client j is given 
by A (time taken to reach the facility serving j); the goal, as before, is to minimize the sum of the facility- 
opening, client-connection, and client-latency costs. Say that A has growth at most p if X(cx) < c p \(x) 
for all x > 0, c > 1. It is not hard to see that for concave A, we obtain the same performance guarantees 
as those obtained in Section |2] (for X(x) = x). So we focus on the case when A is convex, and obtain an 
O (max{(p log 2 n) p ,p log n log m}) -approximation algorithm for convex latency functions of growth p. As 
a corollary, we obtain an 0(plognmax{logn, log m}) -approximation for C p -MLUFL, where we seek to 
minimize the facility-opening cost + client-connection cost + the £ p -norm of client-latencies. 

Theorem 4.1 There is an 0(max{(plog 2 n) p ,plog n log m}^ -approximation algorithm for MLUFL with 
convex monotonic latency functions of growth (at most) p. 

Proof : We highlight the changes to the algorithm and analysis in Section 12.11 We again assume that T = 
poly(m) for convenience. This assumption can be dropped by proceeding as in Theorem I2.8[ we do after 
proving the theorem. The objective of ® now changes to min J2i t fiVi,t + Ylj % t ( c *i + -M*)) x ij,t ■ The only 
change to Algorithm[TJis that we now define tg = min{2^ p , T} and set M := \p log 2 (2 1 / p r max ) + 4 log 2 ml = 
O(plog to). Define Lcostj = ^ t \(t)xijj. Note that we now have A(Tj) < 12Lcostj. Define £j to be the 
first phase £ such that tg > Tj. Let the random variable Pj be as defined in Lemma 1231 The failure probabil- 
ity of the algorithm is again at most 1/ poly(m,). The facility-cost incurred in O(plognlogm) ^ t fiUij, 
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and the connection cost of client j is at most 4C*. We generalize Lemma [231 below to show that E [Lj] = 
0((plog 2 n) p ) A(i^) < 0((2plog 2 n) p ) ■ A(rj), which yields the desired approximation. We have 

Lj < A(£ d ( Tour ^)) ^ A(0(log 2 n) £ i<) < 0(log 2 Pn)A(]T t<), 

«<p,- «<P,- «<P,- 

so E [Lj] < 0(log^ n) [A(E,<^ + E fe >i Prft > 4 + fc]A(E/</ J+ * ■ Now > Pr ^ ^ 4 + ^ 
(D*. E/<^+fc < ^ ' r#W' and ( 2VP " !) ^ ¥ ■ Plu ^ in 8 these in § ives ' 

4\ k 



E[Lj] = O(log^n) • _ ■ \(t e .) • 2 fc = 0((p/]n2yiog*n)A(^). ■ 

Removing the assumption T = poly (m) in Theorem 14. 11 As in the case of Theorem l2.8[ to drop the as- 
sumption that T = poly(m), we (a) solve the LP considering only times in TS = {To, . . . , T&} (where T r = 
[(1 + e) r ] ); and (b) set t t = JS(L-2 £ /p) and the number of phases to TV := \p log 2 (2 1 / p r max /L) + 4 log 2 m] , 
where L = (52j,i,t tx ij,t)/ m - Note that TV = 0(p\ogm), and A(L) < (52j,i,t H t ) x ij,t)/ rn = (52 j Lcost^/m, 
which shows that the expected latency cost incurred for clients j with Tj < to is at most 52 j Lcostj . 

Corollary 4.2 One can obtain an O (p log n maxjlog n, log m}) -approximation algorithm for C P -MLUFL 

Proof : As is standard, we enumerate all possible values of the £ p -norm of the optimal client latencies in 
powers of 2, losing potentially another factor of 2 in the approximation factor. To avoid getting into issues 
about estimating the solution-cost for a given guess (since our algorithms are randomized), we proceed as 
follows. For a given guess Lat, we solve (0 modifying the objective to be min Ej t fiVi,t + 52 j % t c ij x ij,t, 
and we adding the constraint 52j it^ Px ij,t ^ Lat p . Among all such guesses and corresponding optimal 
solutions, let (x, y, z) be the solution that minimizes Ej t fiVi,t + 52 j i t c ij x ij,t + Lat. Let OPT denote 
this minimum value. Note that OPT < 20* , where O* is the optimum value of the £p-MLUFL instance. 
We apply Theorem 14. 1 1 (with A(x) = x p ) to round (x, y, z). Let F, C, and L = Ej Lj, denote respectively 
the (random) facility-opening, connection-, and latency- cost of the resulting solution. The bounds in Theo- 
rem O imply that E [(Ej £y) 1/p ] < ( E [Ej L j]) 1/P < 0(p log 2 n) Lat, which combined with the bounds 
on E [F] and E [C] , shows that the expected total cost is O (p log n maxjlog n, log m}) OPT. ■ 

We obtain significantly improved guarantees for related MLUFL, metric uniform MLUFL, and ML with 
(convex) latency functions of growth p. For related MLUFL and ML, the analyses in Sections 12.21 and l3l 
directly yield an 0[2°^) -approximation guarantee since for both problems, we can (deterministically) 
bound the delay of client j by 0(a-point of (for suitable a). For metric uniform MLUFL, we obtain 
an 0(l)-approximation bound as a consequence of Theorem 12.121 this follows because one can devise 
an 0(l)-approximation algorithm for the zero-facility-cost version of the problem by adapting the ideas 
used in 0. Thus, we obtain an 0(l)-approximation for the £ p -versions of related-MLUFL, metric uniform 
MLUFL, and ML. 

£>route MLUFL with length bounds. All our algorithms easily generalize to k-route length-bounded 
MLUFL, where we are given a budget B and we may use (at most) k paths stalling at r of (d-) length at 
most B to traverse the open facilities and activate them. This captures the scenario where one can use k 
vehicles in parallel, each with capacity B, starting at the root depot to activate the open facilities. Observe 
that with B = oo, we obtain a generalization of the k-traveling repairmen problem considered in |[T4l . 

We modify © by setting T = B and setting the RHS of ® to kt. In Algorithm [Q we now obtain a 
tour Tour^ in phase £ of (expected length) 0(log 2 n)kt£. Each facility i € Tour^ satisfies 52t<t Vi>t > 0, 
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so d(i, r) < ti, and we may therefore divide T01117 into k tours of length at most 0(log 2 n)tg. Thus, we 
obtain the same guarantee on the expected cost incurred, and we violate the budget by an 0(log 2 n) -factor, 
that is, we get a bicriteria (polylog, 0(log 2 n)) -approximation. Similarly, for related MLUFL and ML, we 
obtain an (O(l), 0(1)) -approximation. For ML, we may again use either (ILPlb or ( |LP27?[ ): in both LPs 
we set T = B; in (ILPlb . we now have the constraint J2 e deZ e ,t < kt, and in ( |LP2p[ ), the RHS of (O is 
now k. For metric uniform MLUFL, we modify (lUnif-PI ) in the obvious way: we now have J2t Vit — ^ f° r 
each time t, and t now ranges from 1 to B. We can again apply Theorem l2. 12l here to obtain a (unicriteria) 
0(l)-approximation algorithm: for the zero-facility-location problem (where we may now "open" at most 
kB facilities), we can adapt the ideas in Q to devise an 0(l)-approximation algorithm. 

Finally, these guarantees extend to latency functions of bounded growth (in the same way that guar- 
antees for MLUFL extend to the setting with latency functions). Thus, in particular, we obtain an O(l)- 
approximation algorithm for the £ p -norm ^-traveling repairmen problem; this is the first approximation 
guarantee for this problem. 

Non-uniform latency costs. We consider here the setting where each client j has a (possibly different) 
time-to-cost conversion factor Xj, which measures j's sensitivity to time delay (vs. connection cost); so the 
latency cost of a client j is now given by Xjtj, where tj is the delay faced by the facility serving j. 

All our guarantees in Sections 12.21 12.31 and [3] continue to hold in this non-uniform latency setting. 
In particular, we obtain a constant approximation guarantee for related metric MLUFL, metric uniform 
MLUFL, and ML. Notice that the metric uniform MLUFL problem cannot now be solved via a reduction 
to the metric-UFL variant discussed in Section 1231 however we can still use Corollary 12. 131 to obtain a 
10.773-approximation. For general MLUFL, it is not hard to see that our analysis goes through under the 
the assumption T = poly (to). However, the scaling trick used to bypass this assumption leads to an extra 
0(log(y 31 ^)) factor in the approximation. 

Recall that the scaling factor L (in the definition of te) in Section I2TT1 was defined as Lj/m, where 

L* = Y^i t t t x ij,t- Now, L* = J2i,t ^j tx ij,t> and we set L = Ylj V 0ne can a 8 ain ar S ue tnat tne 

expected latency-cost of each client j is at most Xj ; -0(log 2 n) -ma^L, Tj}, so we incur an 0(log 2 n)-factor 
in the latency-cost. The number of phases, however, is M := [log 2 (2r max /L) + 41og 2 m\ , and 2r max /L = 

O ( m ^ r i 3 ) < 0( m x max ), which gives an extra log 2 (A max /A m i n ) factor in the facility-opening cost. 
Finally, as before, these guarantees also translate to the A; -route length-bounded versions of our problems. 
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A Proofs omitted from the main body 

Proof of Theorem H33 : Consider a GST instance (H = (V, E),r, {d e } eeE , {Gj C V}f =1 ). Let n = \V\. 
We may assume that H is the complete graph, d is a metric, and the groups are disjoint. We abbreivate p n , m 
to p below. Let Q denote the collection of all solutions to the path-variant of GST; that is, Q consists of 
all paths starting at r that visit at least one node of each group. For a path Q € Q and a group Gj, define 
Qj to be the portion of Q from r to the first node of Gj lying on Q. Let dj(Q) = YleeQj be the length 
of Qj) that is, dj(Q) is the latency of group j along path Q. Consider the following LP-relaxation for the 
path- variant of GST, and its dual. We have a variable xq for every Q € Q indicating if path Q is chosen. 
We use Q below to index the paths in Q. 
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(IF1 has an exponential number of variables. But observe that separating over the constraints (fTOb in the 
dual involves solving an MGL problem. The minimum value (over all Q £ Q) of the LHS of (PTOl) is the 
optimal value of the MGL problem defined by {{Gj, \j)}, where we seek to minimize the weighted sum of 
client latency costs. (This weighted group latency problem can be reduced to the unweighted problem by 
"creating" Xj copies of each group Gj (we can scale the Xjs so that they are integral); equivalently (instead 
of explicitly creating copies), one can simulate this copying-process in whatever algorithm one uses for 
(unweighted) MGL.) Thus, a p-approximation algorithm for MGL yields a ^-approximate separation oracle 
for (ID' b - Now, applying an argument similar to the one used by Jain et al. [24] shows that one can use this 
to (also) obtain a p-approximate solution (x, M) to dPl ). 

We now use randomized rounding to round (x, M) and obtain a group Steiner tree of cost at most 
0(logm)M. We pick path Q independently with probability min{41ogm • xq, 1}. Let Q! C Q denote 
the collection of paths picked. Note that for every group Gj, we have ^QGQ d J (Q)<2A/ X Q — \- ^° a 
standard set-cover argument shows that with probability at least 1 — 1/m, for every j, there is some path 
qw £ Q> S uch that dj(Q^) < 2M. We may assume that Y1q x Q = 1> so Chernoff bounds show 
that \Q'\ = O (log to) with overwhelming probability. The group Steiner tree T consists of the union of 
all the Qj (sub)paths (deleting edges to remove cycles as necessary). Clearly, the cost of / is at most 
| Q! | • 2M = 0(log m)M. Note that T also yields a path of length 0(log m)M starting at r and visiting all 
groups, so the integrality gap of (IP 7 !) is 0(log m). ■ 



Extension of Lemma 1331 to MGL. Notice that we did not use anything specific to the minimum-latency 
problem in the proof, and so essentially the same proof also applies to MGL. Recall that in MGL, we have 
a set F of facilities and a d-metric on F U {r}, and a collection of m groups {Gj C F}. Analogous to 

(LP2p' 6 ^), the LP-relaxation with path variables for MGL and its dual are as follows. 
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The LP-relaxation (LP'^' ) with tree variables is obtained by replacing Vu with Tu in (LP' 



(a,b) 
V 



The 

orienteering problem that we need to solve now to separate over the constraints (fTTT> is group orienteering: 
given a reward 9j )t for each group Gj, we want to determine if there is a path (or tree) rooted at r of length at 
most bt such that the total reward of the groups covered by it is more than f3 t - Given these changes, the proof 
that one can obtain a feasible solution (x, y) to (LP'^' 6 ^) of cost at most the optimal value of (LP'^' 1 ^) is 
as in the proof of Lemma 1331 
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